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Abstract 

We study the empirical measure La„ of the eigenvalues of non-normal 
square matrices of the form A„ = ?7„Z)„V„ with V„ independent Haar dis- 
tributed on the unitary group and D„ real diagonal. We show that when the 
empirical measure of the eigenvalues of D„ converges, and D„ satisfies some 
technical conditions, La„ converges towards a rotationally invariant measure 
on the complex plan whose support is a single ring. In particular, we provide 
a complete proof of Feinberg-Zee single ring theorem Q. We also consider 
the case where Un,Vn are independent Haar distributed on the orthogonal 
group. 

1 The problem 

Horn lUSl asked the question of describing the eigenvalues of a square matrix with 
prescribed singular values. If A is a n x n matrix with singular values si > .. . > 
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Sn > and eigenvalues , . . . , ?i„ in decreasing order of absolute values, then the 
inequalities 



k k 



n n 



JJ^ < JJ^^^j, if ^ < « and 



(1) 



7=1 7=1 



7=1 7=1 



were shown by Weyl [|25l to hold. Horn established that these were all the rela- 
tionships between singular values and eigenvalues. 

In this paper we study the natural probabilistic version of this problem and 
show that for "typical matrices", the singular values almost determine the eigen- 
values. To frame the problem precisely, fix 5i > . . . > i',, > and consider n x n 
matrices with these singular values. They are of the form A = PDQ, where D is 
diagonal with entries sj on the diagonal, and P, Q are arbitrary unitary matrices. 

We make A into a random matrix by choosing P and Q independently from 
Haar measure on U (n), the unitary group of nx n matrices. Let . . . , ?i„ be the 
(random) eigenvalues of A. The following natural questions arise. 

1 . Are there deterministic or random sets {sj}, for which one can find the exact 
distribution of {Xj}? 

2. Let Ls = Sj=i ^■^7 ^'^^ n ^ j=i ^^^7 denote the empirical measures 
of 5 = {sj} and A = {'kj}. Suppose S„ are sets of size n such that Ls^ 
converges weakly to a probability measure 9 supported on M_|_. Then, does 
La converge to a deterministic measure on the complex plane? If so, how 
is the measure /j determined by 0? 

3. For finite n, for fixed S, is La concentrated in the space of probability mea- 
sures on the plane? 

In this paper, we concentrate on the second question and answer it in the affir- 
mative, albeit with some restrictions. In this context, we note that Fyodorov and 
Wei [7 . Theorem 2.1] gave a formula for the mean eigenvalues density of A, yet in 
terms of a large sum which does not offer an easy handle on asymptotic properties 
(see also [6] for the case where D is a projection). The authors of [|7| explicitely 
state the second question as an open problem. 

Of course, questions 1-3. above are not new, and have been studied in various 
formulations. We now describe a partial and necessarily brief history of what is 
known concerning questions 1. and 2.; partial results concerning question 3. will 
be discussed elsewhere. 
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The most famous case of a positive answer to question 1 . is the Ginibre en- 
semble, see f81, and its asymetric variant, see IfTTl . (There are some pitfalls in the 
standard derivation of Ginibre's result. We refer to fT6] for a discussion.) Another 
situation is the truncation of random unitary matrices, described in f2^. 

Concerning question 2., the convergence of the empirical measure of eigen- 
values in the Ginibre ensemble (and other ensembles related to question 1.) is 
easy to deduce from the explicit formula for the joint distribution of eigenvalues. 
Generalizations of this convergence in the absence of such explicit formula, for 
matrices with iid entries, is covered under Girko 's circular law, which is described 
in BH; the circular law was proved under some conditions in [|2l and finally, in full 
generality, in [ITOll and ||22]| . Such matrices, however, do not possess the invari- 
ance properties discussed in connection of question 2. The single ring theorem 
of Feinberg and Zee f5l is, to our knowledge, the first example where a partial 
answer to this question is offered. (Various issues of convergence are glossed over 
in BH and, as it turns out, require a significant effort to overcome.) As we will see 
in Section [3l the asymptotics of the spectral measure appearing in question 2. are 
described by the Brown measure of 7?-diagonal operators. (The Brown measure is 
a continuous analogue of the spectral distribution of non-normal operators, intro- 
duced in [3J.) 7?-diagonal operators were introduced by Nica and Speicher fT8l| in 
the context of free probability; they represent the weak* -limit (or more precisely, 
the limit in ^-moments) of operators of the form UD with U unitary with size 
going to infinity and D diagonal, and were intensively studied in the last decade 
within the theory of free probability, in particular in connection with the problem 
of classifying invariant subspaces lfT2llT3l . 

2 Limiting spectral density of a non-normal matrix 

Throughout, for a probability measure jj supported on M or on C, we write G^ for 
its Stieltjes transform, that is 



G^ is analytic off the support of /j. We let iW„ denote the Haar measure on the 
n-dimensional unitary group uiji). Let {PmQn}n>\ denote a sequence of inde- 
pendent, :^„-distributed matrices. Let D„ denote a sequence of (possibly random) 

diagonal matrices with real positive entries 5„ = {sf^} on the diagonal, and intro- 
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duce the empirical measure of the symmetrized version of D„ as 




We write Go,, for G^^^^ . For a measure /j supported on M+, we write jii for its 
symmetrized version, that is, for any < a < b < 



fi{ [-a, -b] )^jj{[a,b])^ [a, b] ) . 



[H) 

Let An = PnDnQn, let A„ = {k\ '} denote the set of eigenvalues of A„, and set 



We refer to La„ as the empirical spectral distribution (BSD) of A„. Finally, for 
any matrix A, we set ||A|| to denote the £^ operator-norm of A, that is, its largest 
singular value. 

The main result of this paper is the following. 

Theorem 1. Assume {LD„}n converges weakly to a probability measure com- 
pactly supported on Assume further 

1. There exists a constant M > so that 



2. There exist a sequence of events {Qn} with P{ Q^) and constants 5, 5' > 
so that for any z G C, with the minimal singular value ofzJ —An, 





(2) 



(3) 



3. There exist constants K,K,}d > such that, for all n large. 



G^{z)-GDM\<^^y if3(z)>n- 



(4) 



4. There exists a constant Ki such that 



Gq{z)\<Ki on C+. 



(5) 
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Then the following holds. 

a. La„ converges in probability to a limiting measure /ua, rotationally invariant 
in C. 

b. The measure fJA possesses a radially -symmetric density pA with respect to 
the Lebesgue measure on C, which is characterized as follows. For z G C, 
let := 0ffl?i|2|, where = is the symmetric Bernoulli 
measure with atoms at { — |z|, |z|}, and ffl denotes free convolution. Then, 
Pa(z) = ^^z{j log \x\dv^{x)), where denotes the Laplacian with respect 
to the variable z. 

c. The support ofi^A is a single ring: there exist constants 0<a<b<oaso 
that 

supp/jA = {re'^ : a<r <b} . 
Further, a = if and only if J x^^d&{x) = 

See Remark[6]for an explicit characterization of the free convolution appearing 
in TheoremfH and lH Ch. 5] for general background. A different characterization 
of Pa, borrowed from [[TT]|. is provided next. 

Remark 2. We provide, following [ITTl Theorem 4.4 and Corollary 4.5], an alter- 
native characterization of and its support. Recall that 0({O}) = by Assump- 
tion [51 Let 0^^ denote the push forward of by the map z ^ z}, i.e. 0^'^ is the 
weak limit of {Ljfi]. Let S denote the 5-transform of 0^^ (see fTTl, Section 2] for 
the definition of the S transform of a probability measure on M and its relation to 
the R transform). Define Fif) = s{l/Vt- 1) on 2) = (0, 1]. Then, F maps © to 
the interval 

and has an analytic continuation to a neighborhood of © , and F' > on 'D . Fur- 
ther, with pA(re'^) = PA{r), it holds that 

1 0, otherwise. 
Finally, Pa has an analytic continuation to a neighborhood of {a,b]. 



5 



Theorem[T]is generalized to the case where C/„, V,, follow the Haar measure on 
the orthogonal group in Theorem [T6l 

As a corollary of Theorem [U we prove the the Feinberg-Zee "single ring the- 
orem". 

Corollary 3. Let V denote a polynomial with positive leading coefficient. Let the 
n-by-n complex matrix X„ be distributed according to the law 



where Zn is a normalization constant and dX the Lebesgue measure on n-by-n 
complex matrices. Let Lx„ be the BSD ofX^. Then {Lx„}n satisfies the conclusions 
of TheoremUlwith the unique minizer of the functional 



on the set of probability measures n on R+. 

Corollary [3] will follow by checking that the assumptions of Theorem [T] are 
satisfied for the spectral decomposition X„ = UnDnVn, see Section[6l 

The second hypothesis in Theorem [T] may seem difficult to verify in general; 
we show in the next corollary that adding a small Gaussian matrix guarantees it. 

Corollary 4. Let {Dn)n>o be a sequence of matrices satisfying the assumptions 
of Theorem\l\ except for ([3]) and assume that \\D'^^\\ is uniformly bounded. Let 
Nfi be a n X n matrix with independent (complex) Gaussian entries of zero mean 
and covariance equal identity. Let U„, V„ follow the Haar measure on unitary nxn 
matrices. Then, the empirical measure of the eigenvalues ofY„ := UnD„Vn-\-n~'^Nn 
converges weakly in probability to /ua in Theorem\l\for any y G (j, °°). 

Example 5. An exemple of sequence {Dn)n>o satisfying the hypotheses of Corol- 
lary |4] is given as follows: take /j a compactly supported probability measure 
on M+*. Assume the inverse F^^ of the distribution function F{x) = ia{[0,x]) 
is Holder continuous and that has a uniformly bounded Stieltjes transform on 
C+. Then the diagonal matrix D„ with entries 




'n 




5^ = inf{5:A/([0,5]) > -}, l<i<n, 



satisfies the hypotheses of Corollary |4l 
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The main difficulty in studying the ESD La„ is that A„ is not a normal matrix, 
that is A„A* 7^ A*A„, almost surely. For normal matrices, the limit of ESDs can 
be found by the method of moments or by the method of Stieltjes' transforms. 
For non-normal matrices, the only known method of proof is more indirect and 
follows an idea of Girko [9] that we describe now (the details are a little different 
from what is presented in Girko ||9l or Bai |l2l). 

From Green's formula, for any polynomial P{z) = n}=i ^ ^;)' have 



1 r " 

— j A\K(z)log|P(z)|Jm(z) = ^\/(X;), forany\(/GC; 



where m(-) denotes the Lebesgue measure on C. Applied to the characteristic 
polynomial of A„, this gives 

1 



Mf{z)dLA,Xz) 



Inn 



Ann 



A\|/(z) log I det(z/ -An) \dm{z) 



A\\f{z) logdet(z/ - A„) {zl - AnYdmiz) 



It will be convenient for us to introduce the 2n x 2n matrix 



Hn 



zI-A„ 

{zi~A,y 



(7) 



It may be checked easily that eigenvalues of are the positive and negative of 
the singular values of z/ — A„. Therefore, if we let denote the ESD of H^, 

/^rfv^W = ^tr((^-//„r^), 
1 1 f 

-logdet(z/-A„)(z/-A„)* = -logdet|//,^| =2 / log |x| Jv^(x) . 



then 



Thus we arrive at the formula 



j Mf{z)dLASz) = ^ j AW) 



\og\x\dyl{x) 



dm{z). 



(8) 



This is Girko 's formula in a different form and its utility lies in the following 
attack on finding the limit of La„. 
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1. Show that for (Lebesgue almost) every z G C, the measures vf, converge 
weakly in probability to a measure as n ^ 0°, and identify the limit. 
Since //^ are Hermitian matrices, there is hope of doing this by Hermitian 
techniques. 

2. Justify that J \og\x\dvf,{x) — > / \og\x\dv^{x) for (almost every) z. But for 
the fact that "log" is not a bounded function, this would have followed from 
the weak convergence of to v^. As it stands, this is the hardest technical 
part of the proof. 

3. A standard weak convergence argument is then used in order to convert the 
convergence for (almost every) z of to a convergence of integrals over z. 
Indeed, setting h{z) := J log \x\dv^{x), we will get from ([8]) that 



4. Show that h is smooth enough so that one can integrate the previous equa- 
tion by parts to get 



which identifies Ah{z) as the density (with respect to Lebesgue measure) of 
the limit of La„. 

5. Identify the function h sufficiently precisely to be able to deduce properties 
of Ah{z). In particular, show the single ring phenomenon, which states 
that the support of the limiting spectral measure is a single annulus (the 
surprising part being that it cannot consist of several disjoint annuli). 

Girko's equation ([8]) and these five steps give a general recipe for finding limiting 
spectral measures of non-normal random matrices. Whether one can overcome the 
technical difficulties depends on the model of random matrix one chooses. For the 
model of random matrices with i.i.d. entries having zero mean and finite variance, 
this has been achieved in stages by Bai |l2l, Gotze and Tikhomirov [[T0|. Pan and 
Zhou lfT9l and Tao and Vu ll22l . While we heavily borrow from that sequence, a 
major difficulty in the problem considered here is that no independence between 
entries of the matrix A„ is present here. Instead, we will rely on properties of the 




(9) 



c 




(10) 



c 
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Haar measure, and in particular on considerations borrowed from free probability 
and the so called Schwinger-Dyson (or master-loop) equations. Such equations 
were already the key to obtain fine estimates on the Stieltjes transform of Gaussian 
generalized band matrices in [,14.1 . In [4J, they were used to study the asymptotics 
of matrix models on the unitary group. Our approach combines ideas of [14ll 
to estimate Stieltjes transform and the necessary adaptations to unitary matrices 
as developped in dU. The main observation is that one can reduce attention to 
the study of the ESD of matrices of the form {T -\-U){T -\-Uy where T is real 
diagonal and U is Haar distributed. In the limit (i.e., when T and U are replaced 
by operators in a C*-algebra that are freely independent, with T bounded and self 
adjoint and U unitary), the limit ESD has been identified by Haagerup and Larsen 
[fTTI . The Schwinger-Dyson equations give both a characterization of the limit 
and, more important to us, a discrete approximation that can be used to estimate 
the discrepancy between the pre-limit ESD and its limit. These estimates play a 
crucial role in integrating the singularity of the log in Step two above, but only 
once an a-priori (polynomial) estimate on the minimal singular value has been 
obtained. The latter is deduced from assumption[3l In the context of the Feinberg- 
Zee single ring theorem, the latter assumption holds due to an adaptation of the 
analysis of [|2T1 . 

Notation 

We describe our convention concerning constants. Throughout, by the word con- 
stant we mean quantities that are independent of n (or of the complex variables 
z, zi). Generic constants denoted by the letters C,c or R, have values that may 
change from line to line, and they may depend on other parameters. Constants 
denoted by C,, K, K and k' are fixed and do not change from line to line. 

3 An auxiliary problem: evaluation of and con- 
vergence rates 

Recall from the proof sketch described above that we are interested in evaluating 
the limit of the ESD of the matrix H^, see (|7]). Note that for z 7^ 0, is also 
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the ESD of the matrix Hf^ given by 



Hi 














Qn ' 


p* 

n 








Qn ' 


p* 






Hi 



/\z\r 

zI-A„ 

zi-a: 

Pn 

q: 



Pn 
Qn 



(11) 



where W,^ = zQnPn/\z\ is unitary and ^n distributed. We are thus led to the study 
of the ESD for a sequence of matrices of the form 



5„ 
B: 



(12) 



with Bn = Un + Tn, Tn being a real, diagonal matrix, and U„ a fH'n unitary matrix. 
We denote in short 



Un 





u* 





u: 



T„ 
Tn 



(13) 



3.1 Limit equations 

We begin by deriving the limiting Schwinger-Dyson equations for the ESD of 
Y„ . Throughout this subsection, we consider a non-commutative probability space 
(j^,*,^) on which variable a variable U lives and where ^ is a tracial state satis- 
fying the relations n{{UU* - 1)^) = 0, niV) = for a e Z\ {0}. is the unique 
non-commutative law of bounded variables which is invariant under unitary con- 
juguation, and therefore corresponds to the asymptotics of the Haar measure. In 
the sequel, 1 will denote the identity in ^ . We refer to UJ Section 5.2] for defini- 
tions. 

Let r be a self-adjoint (bounded) element in , with T freely independent with 
U . Recall the non-commutative derivative 3, defined on elements of C(r, U , U*) 
as satisfying the Leibniz rules 

dPQ = dPxl®Q + P®\xdQ, (14) 

dU = U ®\,dU* = -l®U\dT = 0(S>0. 
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(Here, ® denotes the tensor product and we write A ®5 xC^D = {AC) ® (BD).) 
d is defined so that for any 5 G ^ so that B* = -B, any P eC{U,U*,T), 



P{Ue'^'^,e-'^^U*,T) = P{U, U\T) +edP{U, U\T)^B + o(£) , 

where we used the notation A ® 5|jC = ACB. 

By the invariance of /j under unitary conjuguation, see [|24l Proposition 5.17] 
or |[Ti (5.4.31)], we have the Schwinger-Dyson equation 

^®^{dP)=Q. (15) 

We continue to use the notation Y, U, U* and T in a way similar to (fT2l) and 
([13]). So, we let Y = U + U* + T with 

Hy^)' -'-{v'l)' '-[ri)- 

We extend jji to the algebra generated by U, U* and T by putting for any A, B,C,D& 

Observe that this extension is still tracial. 

The non-commutative derivative d extends naturally to the algebra generated 
by the matrix- valued U,U*, T, using the Leibniz rule (fT4l) together with the rela- 
tions 

au-u®p, au* = -p(g)U*, (i?) 



where we denoted p = 




In the sequel we shall apply d to analytic 



functions of U + U* and T such as products of Stieltjes functional of the form 
(z — bV — bV* — aT)^^ with z E C\]R and a, G M. Such an extension is straight- 
forward; d continues to satisfy the Leibniz rule and 

d{z-bl]-bU*-aTy^ = 

b{z-bV-bV* -aTy\v®p-p®V*){z-bV-bV* -aT)~^ 
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Introduce the notation, for zi,Z2^ C+, 

G(zi,Z2) = A'((^i-Y)-i(z2-T)-i) , 

Gu{zi,Z2) = A^(U(zi-Y)-i(z2-T)-i) , 

Guizi) = A'(U(zi-Y)-i) , 

Gu*{zi,Z2) = A'(U*(zi-Y)-i(z2-T)-i), (18) 

Gt{zi,Z2) = A'(T(zi-Y)-ife-T)-i) , 

= A'((zi-Y)-% 

Gt{z2) = A'(fe-T)-i). 

We apply the derivative d to the analytic function P = (zi — Y)^^(z2 — T)^^U, 
while noticing that, by (O and (flTI) . 

= p ® + (21 - Y)-iU ® - (zi - Y)- V ® UV. (19) 
For any smooth function Q, 

^i{V*QV)=^i{{l-p)Q) 

due to the traciality of /j and UU* = 1— p. Further, Pp = P and thus i-i{pP) = /jiP), 
and by symmetry (note that (1 — p){zi — Y)^^(z2 — T)^^ and p{zi — Y)^^(z2 — 
T)^^ are given by the same formula up to replacing [U ,U*) by {U*,U), which 
has the same law) 

^i{{\-p){zl-Y)-\z2-T)-') = \^,{{zl-Y)-\z2-T)-'). 

The same equality holds without the last factor (z2 — T)^\ and so we get from 

-G(/(zi,Z2) = -Gu{zi,Z2)Gu{z\) + -G{z\,Z2)G{z\) . (20) 
Noticing that Gu{z\) is the limit of Z2Gu{z\-,Z2) as Z2 °°, we find by (|20l) that 

^-Gij{zi) = -Gu{zif + \G{zi)\ 
and therefore, as Gu{z\) goes to zero as z\ — > °°, 

= ^(-^ + + = ^(-1 + v/l+4G(zi)2) . (21) 
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Here, the choice of the branch of the square root is determined by the expansion 
of Gu{z) at infinity and the fact that both G{z) and Gu{z) are analytic in C+. This 
equation is then true for all zi G C+. 
Moreover, by (|20|) and (|2TI) . we get 



^ / N 1 G{zhZ2)G{zi) G{zi,Z2)G{zi) 

Gu{ZuZ2) = :z-—-r-—-- = -====. (22) 

2 l+2G(/(zi) l + Vl+4G(zi)2 

(Again, here and in the rest of this subsection, the proper branch of the square root 
is determined by analyticity.) Let R denote the i?-transform of the Bernoulli law 

li := (5_i+5+i)/2, that is, 

, Vl+4z2-i 2z 



2z v^rr4?+r 

see [dl Definition 5.3.22 and Exercise 5.3.27], so that we have 

Gu{zuZ2) = ^G{zuZ2)R{G{zi)) . (23) 

Repeating the computation with Gu*, we have Gu* = Gy. Algebraic manipula- 
tions yield 

Gr (21,22) = Z2G{zuZ2) -G{zi) , (24) 

2Gu{zuZ2) +Gt{zi,Z2) = ZiG{zi,Z2) -Gt{z2) ■ (25) 
Therefore, we get by substituting (|23l) and (|24|) into (l25l) that 

G{zuZ2)RiG{zi)) +Z2G{ZUZ2) - G{zi) = ZlG(zi,Z2) - Gt{z2) , (26) 
which in turns gives, for any zi,Z2 ^ C+, 

G(zi,Z2) +Z2-zi)= G(zi) - Gr(z2) • (27) 

Thus, 

Gr(z2) = G(zi) when z2 = zi-^(G(zi)). (28) 



The choice of Z2 as in (|28l) is allowed for any zi G C+ because G : C+ ^ and 
we can see that : C~ — *• C~. Thus 3z2 > 3zi > 0, implying that such Z2 belongs 
to the domain of Gj. 

The relation (|28] ) is the Sch winger-Dyson equation in our setup. It gives an 
implicit equation for G(-) in terms of Gt{-)- Further, for z with large modulus. 
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G{z) is small and thus z i— > z — i?(G(z)) possesses a non-vanishing derivative, and 
further is close to z. Because Gt is analytic in the upper half plane and its deriva- 
tive behaves like 1/z^ at infinity, it follows by the implicit function theorem that 
(l28l) uniquely determines G(-) in a neighborhood of °°. By analyticity, it thus fixes 
G(-) in the upper half plane (and in fact, everywhere except in a compact subset 
of M), and thus determines uniquely the law of Y. 

Remark 6. Let /jt denote the spectral measure of T, that is j fd/Jr = KfiT)) 
for any / G Ci,(M) . We emphasize that Gt is not the Stieltjes transform of the law 
of T; rather, it is the Stieltjes transform of the symmetrized version of the law of 
T, that is of the probability measure fix- With this convention, (|28l ) is equivalent 
to the statement that the law of Y, denoted /jy, equals the free convolution of fir 
and i.e. /jy =/irffl?ii, where "k^ = i^-\z\ +S|z|)/2 is the Bernoulli law that puts 
mass J at ±|z|. 

In the next section, we will need the following estimate. 

Lemma?. //|Gr(-)l < Ki on C+ then |G(-)| < Ki on C+. 

Proof Recall that if z G C+ then G(z) G and also R{G{z)) G because R 
maps into (regardless of the branch of the square root taken at each point). 
Thus,y = z-R{G{z))eC+. Therefore, \G{z)\ = |Gr(y)| <Ki. □ 

3.2 Finite n equations and convergence 

We next turn to the evaluation of the law of Y„. We assume throughout that the 
sequence r„ is uniformly bounded by some constant M, that Lj^ fjj weakly 
in probability, and further that dH) and ^ are satisfied with T„ and the spectral 
distribution of T replacing D„ and 0. Recall first, see [T, (5.4.29)], that by in- 
variance of the Haar measure under unitary conjuguation, with P G C(r, U, U*) a 
noncommutative polynomial (or a product of Stieltjes functional), 

£[ltr®;itr(aP(T„,U„,U:))] =0. (29) 
2n 2n 

This key equality can be proved by noticing that for any nx n matrix B such that 
B* = -B, for any {k,i) G [l,n], if we let Un{t) = Unc'^ and construct Un(0 and 
Un*(0 with this unitary matrix, 

o = a,£[(/'(T„,u„(o,u:(0)y =£[(a/'(T„,u„,u:)ttB),^^] oo) 
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with B 




B 



Letting A(fc,£) be the n x n matrix so that A(fc,£),- ; 



we can choose in the last equality 5 = A(fc, £) — A(£,fc) oxB = i{A{k,£) +A{£,k)). 
Summing the two resulting equalities and then summing over k and i yields (|29l) . 

We denote by G" the quantities as defined in (fTSi) . but with £'[^tr] replacing /u 
and the subscript n attached to all variables, so that for instance 



We get by taking P = {zi-Yn)-\z2 -Tn)-^Vn that 

^GUzuZ2) = -Glj{zuZ2)G'lj{zi) + ^G''{zuZ2)G"{zi) + 0{n,zuZ2), (31) 



with 

0{n,zi,Z2) =E 



1 



1 



1 



1 



(-tr-£[-tr])®(-tr-£[-tr])a(zi-Y„)-^(z2-T„)-^U 



'2n 



2n 



In 



In 



Further, by the standard concentration inequality for i//„, see flU Corollary 4.4.31], 
for any smooth function P : 'U{n) ^ C, 

21 



lT(P)-£ll,r](p; 



with \\P\\l the Lipschitz constant of P given by 

II^IIl=II^^IIc 



1 n n? 
< - Pi, 

n^ 



(32) 



if D is the cyclic derivative given by D = mod with m(A (8)5) = BA. Applying 
(|32|) to each term of dP (recall formula (HH)), we get that for 3(zi) > 0, 



|0(«,Zl,Z2)| < 



c 



n2|3(z2)|3(zi)2(3(zi)Al)- 
Multiplying by Z2 and taking the limit as Z2^ °° we deduce from (|3TI) that 



(G"(zi))2 = 2G'^(zi)(l +2G"^(zi)) - Oi(n,zO 



(33) 



where 

Oi(n,zi) 



4E 
O 



(ltr-£[;^tr])®(itr-£[^tr])a(zi-Y.)-iU. 



1 



1 

2^" 



1 



1 



n23(zi)2(3(zi)Ai; 
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In particular, 



G'h{zi)=^-{-l + y/l+4G"(zi)2 + 40i(n,zi)) , (34) 

with again the choice of the square root determined by analyticity. 

Recalling that (l24l) and (|25l) remain true when we add the subscript n and 
combining (|3T1) and (|24|) . we get 

<^"(2i'22)f7rT?7l^+22-^il =G"(zi)-G^(z2) + 0(n,zi,Z2), (35) 



(l+2G'^(zi)) 



with 



~ 2C>(n,zi,z2) 
0(n,zi,Z2) 



(l + 2G'^(zi)) 



Hence, if we define 



then 

and therefore 



Z2 = ^K„(zi):=zi-^^:^^, (36) 



G"{zx) = G"j{z2) + d{n,zx.Z2) 



G"(zi) =G^(i|/„(zi))+0(n,zi,l^,(zi)). (37) 

Equation (|37l) holds at least when 3z2 > for zi as in (l36l) . In particular, for 3 (zi ) 
large (say larger than some M), it holds that G" {z\ ) and G'^ (zi ) are small, implying 
that zi is well defined with 3(z2) > 0- Assume Lj^^ converges towards Lj so that 
Gj converges to Gj on C+. Then, the limit points of the sequence of uniformly 
continuous functions (G"(z), G'^(z)) on {z : 3z > M} satisfy dlB and dlH]) and 
therefore equal {G{z):Gu{z)) on {z : 3z > M} by uniqueness of the solutions to 
these equations. Hence, taking n^oo then implies that G" ^ G in a neighborhood 
in the upper half plane close to oo. Since G" and G are Stieltjes transforms of 
probability measures, we have now shown the following (see Remark[6l). 

Lemma 8. Assume Lj^ converges weakly in probability to a compactly supported 
probability measure j-tj. Then, Ly„ converges weakly, in probability, to /vy = //j ffl 
Xi. In particular, ifLD„ converges weakly in probability to a probability measure 
0, then for any z G C, vj; converges weakly in probability to ffl Xui. 
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(Recall that is the symmetrized version of and note that for z = 0, the state- 
ment of the lemma is trivial.) 

Lemma [8] completes the proof of Step one in our program. To be able to 
complete Step two, we need to obtain quantitative information from the (finite n) 
Schwinger-Dyson equations (|37]) : our goal is to show that the left side remains 
bounded in a domain of the form {z E C+ : 3z > n^*^} for some c > 0. Toward 
this end, we will show that in such a region, \(/„ is analytic, 3v[^n(z) > 3z/2 AC 
for some constant C and 0(n,zi,\|/„(zi)) is analytic and bounded there. This will 
imply that (|37l) extends by analyticity to this region, and our assumption on the 



boundedness of will lead to the conclusion. 

As a preliminary step, note that G"(-) and G^(-) are analytic in C+. We have 
the following. 

Lemma 9. There exist constants Ci,C2 such that for all z G C+ with 3(z) > 
Cin^^/^ and all n large, it holds that 

|1+2G'^(z)|>C2[3(z)3a1]. (38) 

Proof Since G'lj{z) is asymptotic to 1/z at infinity, we may and will restrict atten- 
tion to some fixed ball Br C C, whose interior contains the support of Y. But 



3(G«(z)) = -3(z) J ■ 



dnY„{x) 



(9t(z)-x)2 + 3(z)2 
and therefore, as (91 (z) -xY + Z{zf< AR^ for all z,x G 5(0, i?) 

|G"(z)|>|3(G"(z))|>^. (39) 

Moreover, since |G'^(z) | < l/|3(z) |, for some constant c independent of n and all 
n large, we deduce from (l33l) that 



|G"(z)P<^l±^ + . 



|3(z)| n23(z)2(3(z)Al)' 
Combining this estimate and (|39l) . we get that 

2|l+2G'-(z)| |3(z)|2 c |3(z)|^ 

|3(z)| - 167?4 n23(z)2(3(z)Al) - 32i?4 ' ^ ^ 

as soon as 3(z) > C\n^^/^ for an appropriate Ci, and |z| < R. The conclusion 
follows. 
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□ 

As a consequence of Lemma |9] and the analyticity of G" and in C"*", we 
conclude that \|/„ is analytic in {z : 3(z) > Cin^^/^}, for all n large. 

Our next goal is to check the analyticity of z ^ (9(n,z,\|/„(z)) for z E C+ with 
imaginary part bounded away from by a polynomially decaying (in n) factor. 
Toward this end, we now verify that V[;„ (z) G C+ for z up to a small distance from 
the real axis. 

Lemma 10. There exists a constant C3 such that if^ [z) > C^n^^^^, then 3 {'y\in (z) ) > 
3(z)/2. 

Proof Again, we may and will restrict attention to 3 (z) < ^ for some fixed R. We 
divide the proof to two cases, as follows. Let e„ = n^^/^, and set A„ = {z G C+ : 
|G"(z)+//2|>e„}. 

Then, for any z G A;^, and whatever choice of branch of the square root made in 

— 1/2 

dSl), ife,/^ Oi(n,z) is small enough (smaller than e„/2 is fine), then that choice 
can be extended to include a neighborhood of the point w = G" (z) such that with 
this choice, the function r{w) = — i + vTT4m?) is Lipschitz in the sense that 



|G^(z)-r(G"(z))|<Ce.^Oi(n,z) 
On the other hand, again from (|34l) . 



(41) 



G"(z) 2G"^(z) 



l+2G"(z) G«(z) 



<C 



|Oi(n,z) 



G«(z)(l+2G"^(z))| 



Combining the last display with the relation R{Q) = 2r(e)/0, (gT]) and ([391), one 
obtains that for z G A„, 



G"(z) 



l+2G"^{z 



-i?(G"(z)) 



< 



2r(G"(z)) 2G^(z) 



< C 



< 



G"(z) G"(z) 

\Oi{n,z)\ 
G«(z)(l+2G'^(z))| 

|Oi(n,z) 



+ 



G"(z) 



+ C 



l+2G^(z) 
Oi(n,z)| 



u^^"^ e,?|G"(z) 
Oiin,z)\ 



< 



< 




(42) 
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Since the above right hand side is smaller than 3(z)/2 for 3(z) > n and 
3(i?(G"(z))) < 0, we conclude that for z G A„ n {3(z) > n^^/^} 



l+2G'^(z); - 2 

as, regardless of the branch taken in the definition of R{-), 3^(G"(z)) < 0. 

On the other hand, when z G C+ \ A„ and 3(z) > n^^/^, then we have from 
(|34|)that 

\G1j{z) + 1/4| < iVe. + |Oi(n,z)|. 
Thus, under these conditions, 

^ 23G"(.)+3^ 



l+2G«(z); Vl+4(G"^« + l/4), 

< 3(G"(z))+Cv/e„ + |0i(n,z)| < +Cv/e„ + |Oi(n,z)| 

< Cn-'/\ 

where we finally used that on A„, G"(z) is uniformly bounded and so that 3(G"(z)) < 
— 1/4 for z G C+ and n>2. We thus conclude from the last display and (l42l) the 
existence of a constant C3 such that if 3(z) > C^n^^^'^ then 

3(v„(z)) = 3(z)-3(^^gl^)>3W/2. 

as claimed. □ 
From Lemma [TOl we thus conclude the analyticity of z — (9(«,z,\|/„(z)) in 

{z : 3(z) > C3n-i/4}, and thus G"(z)/(1 + 2G'^(z)) is also analytic there. In 

particular, the equality (l37l) extends by analyticity to this region. 

We have made all preparatory steps in order to state the main result of this 

subsection. 

Lemma 11. There exist positive finite constants C(„Cj,C^ such that, for n> 
and allzE "En := {z : 3(z) > n"^^}, 

\G'\z)\<C^. (43) 

Proof This is immediate from Lemma [9l Lemma [lOl the definition of the 
assumption of Gj and the equality (l37l) . □ 
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4 Tail estimates for 

Our goal in this short section is to prove the following proposition. 
Proposition 12. (i) Fix z G C. Under the assumptions ofTheoremU} 

limlimsup£'[l^ / log Jv^(;c)] = 0. (44) 

ejO n^oo Jq 

Consequently, for any z G C, 

. '°8kMv;W^/logH.vH.), (45) 
in probability. 

(ii) For any smooth compactly supported deterministic function (p on C, 

(p(z) j \og\x\dvl{x)dm{z) ^ j cp(z) j \og\x\d'V^{x)dm{z) , (46) 
in probability. 

Before bringing the proof of Proposition [121 we recall the following elemen- 
tary lemma. 

Lemma 13. Let /ube a probability measure on M. For any real y > 0, it holds that 

^ii-y,y))<2y\Giiy)\. (47) 

Proof We have 

-3{G{iy))= I y^^(dx)> r y^^(dx) > ^iua-y,y)) , 

J y^+x-" J-yy 2y 

from which (l47l) follows. □ 

We can now provide the 
Proof of Proposition HH 

Let R be large enough so that Br C C contains the support of (p. Throughout 
this proof, we may and will restrict attention to z satisfying |z| < R. 

(i) By dl]), we can replace the lower limit of integration in (|44|) with n^^. Let 
denote the Stieltjes transform of £'[v^]. For z 7^ 0, by Lemma [TT] and Lemma 
U\ there exist positive constants ci{R),C2(R) such that whenever 3(m) > n^^'^, it 
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holds that |G^(m)| < C2- For z = 0, L|] is simply the BSD of D„, once symmetrized, 
and hence G|J is also uniformly bounded by dH) and ([5]). 

Therefore, since Gf, is the Stieltjes transform of £'[v^], by Lemma [T3l for any 

y>0, 

E[vl{{-y,y))] < £[vf,((-j Vn-^'))] < 2c2j Vn"^' . 

Thus, we get that for any zE Br and with a G [1,2], 



j\\ogx\rdviix)] 

< E[ {\logx\rdvl{x) + (\logx\rdwl{x)] 

< ((ciV5)logn)«£[v^„((-n-^-',n--i))] 

J 

+ 5^£[v^;((-2(^+i)n-^->,2(^+i)n-^'))](log(2V^'))", 

where < e < 2-'n^'^''. Note that by Lemma [T3] and the estimate on G; 

for j > 0, 



It follows that 



E[ |logx|«Jvf,W]<Ce|log(£)|«, 



(48) 



where the constant C does not depend on z. The estimate (l44l) follows when 
considering a = 1 . 

Moreover, by ©, for a < 2, 



£[1^„ / \\ogx\W,{x) 



< E[lgX{[-n-\n-^])^^^^^-,^\\og<5: 



Z IK] 
nl J 



2-a 



< £ 



V^,([-^-^^-^]) 



}|loga^l¥ 
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by Holder's inequality. The first factor goes to zero because 



-8 . -8n \ 2-a 



Vl{[-n-\n-']) 



<E 



< 2c2n-\ 



We thus get (l44l) from (|48] ). By Chebychev's inequality, the convergence in expec- 
tation implies the convergence in probability and therefore for any 6, 6' > there 
exists £ > small enough so that 



\imP{[ \\ogx\dv^„{x) > d) < d' 



On the other hand, \og\x\dv\{x) converges to j^\og\x\dy^{x) by the weak 
convergence of vj; to in probability for any e > 0. Hence, we get (l45l) . 
(ii) Define the functions f^^: Br ^M., i = 1,2 by 

fniz) = igA\\D„\\<M / ^og{x)d\l{x), 

Jo 

poo 

fniz) = ig,A\\D„\\<M / l0g(jc)Jvf,W, 

and set /„ (z) = (z) + (z) . Because is supported in Br+m on \\Dn\\ <M, fn 
is bounded above. By (l48l) . is bounded, uniformly in z G Br. On the 

other hand, by ([3]), again uniformly in z, E{f,l (z)^) < 5', and therefore 



E [ (/„i(z))2jm(z)<oo. 



Thus, E Jg^ \fn{z)\^dm{z) < °°, and in particular, the sequence of random variables 



lt?„ll|o||„<M / logxdv^ix 



dm{z) 



is bounded in probability. This uniform integrability and the weak convergence 
(l45l) are enough to conclude, using dominated convergence (see |l23l Lemma 3.1] 
for a similar argument). □ 
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5 Proof of Theorem [T] 



By Proposition [T2l see (|46l) . we have, with h{z) '■= J log\x\d\^{x), that for any 
smooth compactly supported function \|/ on C, 

j \\f{z)dLA„iz) ^V(z) h{z)dm{z) , 

c 

in probability. Since the sequence La„ is tight, it thus follows that it converges, in 
the sense of distribution, to 

From Remark [2] (based on (TV, Corollary 4.5]), we have that the limit is actually a 
function. The statement of the theorem follows. □ 



6 Proof of Corollary |3l 

We let Xn be as in the statement of the corollary and write X„ = PnDnQn with 
Pn.Qn unitary and D„ diagonal with entries equal to the singular values {a"} of 
Xn. Obviously, {P,,, Qn}n>i is a sequence of independent, j^„-distributed matrices. 
The joint distribution of the entries of D„ possesses a density on which is given 
by the expression 

i<j i 

where Z„ is a normalization factor, see e.g. [[B Proposition 4.1.3]. Therefore, the 
squares of the singular values possess the joint density 

i<j i 

on M" . In particular, it falls within the framework treated in fl^. By part (i) of 
Theorem 2.1 there, there exist positive constants M,Cii such that /'(oi >M— 1) < 
g Ciin ^ ^j^^ i^j^jjg point 1 of the assumptions of Theorem [T]holds. Point 3 of the as- 
sumptions (with K < 1/4 and k' = 1/2) is an immediate consequence of equation 
(2.26) there. Point 4 of the assumptions is an immediate consequence of equation 
(2.32) there. Thus, it remains only to check point 2 of the assumptions. Toward 
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this end, define = < M + 1 } and note that we may and will restrict atten- 
tion to |z| < M + 2 when checking ([3]). We begin with the following proposition, 
due to [lITTl . 

Proposition 14. Let A be an arbitrary n-by-n matrix, and let A = A + gN where 
N is a matrix with independent (complex) Gaussian entries of zero mean and unit 
variances. Let o„(A) denote the minimal singular value of A. Then, there exists a 
constant Cn independent of A, o or n such that 

P(o„(A)<x)<Ci2v^(^)'. (49) 

The proof of Proposition[T4lis identical to [|2T1 Theorem 3.3], with the required 
adaptation in moving from real to complex entries. We omit further details. 

On the event g„, all entries of the matrix X„ are bounded by a constant multiple 
of y/n. Let N„ be a Gaussian matrix as in Proposition [141 With a > 2 a constant 
to be determined below, set 

g'„ = {all entries of n^^/^Nn are bounded by 1 } . 

Note that because a > 2, on g'^, we have that Oi {n^'^Nn) < 1 . Define A„ = zl—Xn, 
An =An+ n^^Nnlg'^ and An = An + n^^Nn- Then, by (l49l) . with o,, (A„) denoting 
the minimal singular value of A„, we have 

P{<5n{An) < X- gn) < Cnx^n''^^^'' . (50) 

If the estimate (l50l) concerned A„ instead of An , it would have been straightforward 
to check that point 2 of the assumptions of Theorem[T]holds (with an appropriately 
chosen 5, which would depend on a). Our goal is thus to replace, in (|50l) . A„ by 
An, at the expense of not too severe degradation in the right side. This will be 
achieved in two steps: first, we will replace A„ by A„, and then we will construct 
on the same probability space the matrix X„ and a matrix 7,, so that y„ is distributed 
like Xn + n-'^Nnlgi^ but P{Yn ^ Xn) is small. 

Turning to the construction, observe first that from (|50l ). 

P{<5n{An)<x-gn)<Cnx\''^+^''+P{{g:,r)<Cn[x^^^^^ 

(51) 

(a) 

Let Xn =Xn + n^'^Nnlg^. Let {0/} and {/j/} denote the eigenvalues of Wn ~ 

XnX* and of Wn"'^ = {X,["'^){Xn"'^)* , respectively, arranged in decreasing order. 
Note that the density of X„ is of the form 

2-ig-ntr(nxx*))^^^ 
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where the variable x = {xi,j}i<i,j<n is matrix valued and dx = Y[i<i j<n^^i,j^ 

(a) — ' — 

while that of X„ ^ is of the form 

where En denotes expectation with respect to the law of A^,,, and Z„ is the same 

foe) 

in both expressions. Note that ai(X,rO e [c>i(X,0 - l,Oi(X„) + 1]. Because V{-) 

(a) 

is locally Lipschitz, we have that if either Oi <M+ \ or Oi 
then there exists a constant C13 independent of a so that 

n n 

|tr(VM-y(wi«)))| < J2\Vm-V{l^d\<Ci3j2\Q-p,\ 

i=l (=1 

< Cun'/^(^^\Q,-^i,\^^ (52) 

< Ci3n^/2""l^;;tr((n-«/2A^„)2)i/2 < „Ci4-a^ (53) 

where the Cauchy-Schwarz inequality was used in the third inequality and the 
Hoffman-Wielandt inequality in the next (see e.g. [[Tl Lemma 2.1.19]). We em- 
phasize that the constant C14 does not depend on a. In particular, if a > (C14 + 
1) V2 we obtain that on the ratio of the functions /„ = e""tr(i/(iv;,)) 

gn — e-"tr(v(w„ )) |g bounded e.g. by 1 +n^i'*+i-o'- particular, it holds that 

P(oi(xi"^) <M) < (l+n^i4+i-a)p(c>i(X„) <M) 

< {l+n^''+^-'')^P{oi{xi"-^) <M). 

Therefore, the variational distance between the law of X„ conditioned on Oi (X„) < 

fot") fa) 
M and that of X^, conditioned on Oi (X„ ') < M, is bounded by 

4„Ci4+l-a_ 

fot) 

It follows that one can construct a matrix of law identical to the law of X„ 
conditioned on ai{X^) < M, together with X„, on the same probability space so 
that 
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Note that this estimate does not depend on z- Combined with (|5T1) . we thus deduce 
that 

where a was chosen as function of x. This yields immediately point 2 of the 
assumptions of Theorem[Tl if 5 > SCn/l. 

We have checked now that in the setup of Corollary [3l all the assumptions 
of Theorem [T] hold. Applying now the latter theorem completes the proof of the 
corollary. □ 

Remark 15. The proof of Corollary [3] carries over to more general situations; 
indeed, V does not need to be a polynomial, it is enough that its growth at infinity 
is polynomial and that it is locally Lipschitz, so that the results of ll20l still apply. 
We omit further details. 



7 Proof of Corollary a 

We take D„ satisfying the assumptions of Corollary |4] and consider y„ = UnDnVn + 
n~'^Nn, with matrix of singular values D„. Note that y„ = UnDnVn with f7„,y„ 
following the Haar measure. We first show that D,, also satisfies the assumptions 
of Theorem [H when y > ^, except for the second one. Since the singular values of 
Nn follows the joint density of Corollary [3] with V{x) = ^x^, it follows from the 
previous section that i'(||n~5A/'„|| > M) < g^*^!!" and therefore \\Dn\\ < ||Ai|| + 
n^'>'+2 \\n^2Nn II is bounded with overwhelming probability. Moreover, since D„ = 
\Dn + n-m:NnV:\, 

\r (.\ r ,_M^ -g[llA.-A.||] ^ C(||D-i||) ,^i_, 

with C(||D,7^||) a finite constant depending only on ||£>7^|| which we assumed 
bounded. As a consequence, the third condition is satisfied since 

Ir M r Ml<a^J-Y^ K K' 



|3z|2 ri^lZzl- ni\Zz\ 

with y = min{K, ^(y- \)} and 3z > n^^^^^h^'^^i)--'^) . Hence, the resuks of 
Lemma [TT] hold and we need only to check as in Proposition [121 that, if the 
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singular values of zl — ¥„, 



In:=E[\g„ / \og\x\dvl{x)] 
Jo 



vanishes as n goes to infinity for some 5 > and some set g„ with overwhelming 
probability. But A„ = z/ — = zl — U„D„V„ + n^^Nn with Nn a Gaussian matrix, 
and therefore we can use Proposition [14] to obtain (|49l ) with o = n and the 
desired estimate on /„. □ 
Proof of Example |5] 

Lideed, the first and the fourth hypotheses of Theorem [T] are verified since ^ 
is compactly supported and we assumed its symmetrized version has a bounded 
Stieltjes transform. For the third, note that if is Holder continuous with index 
a, 

n \ n _ n\ n ( i±l\ _ p-\ f -a 



(=1 (=1 

1 ; 



2 



where we finally used that F is Holder continuous with index a. □ 



8 Extension to orthogonal conjuguation 

In this last section, we generalize Theorem[I]to the case where we conjuguate D„ 
by orthogonal matrices instead of unitary matrices. 

Theorem 16. Let D,i be a sequence of diagonal matrices satisfying the assump- 
tions of Theorem\I} Let On, On be two nx n independent matrices which follow 
the Haar measure on the orthogonal group and set An = OnDnOn- Then, La„ 
converges in probability to the probability measure jUA described in TheoremUl 

Proof. To prove the theorem, it is enough, following Section [51 to prove the 
analogue of Lemma \TT\ which in turn is based on the approximate Schwinger- 
Dyson equation ([35] ) which is itself a consequence of equation ([291) and concen- 
tration inequalities. To prove the analogue of ([29[) when Un follows the Haar 
measure on the orthogonal group, observe that ([30l) remains true with B' = —B 
which only leaves the choice B = A(fc, £) — A(£,fc) possible. However, taking this 
choice and summing over k, £, yields, if we denote m{A®B) = AB', 

£[ltr® ltr(a/'(T„,U„,U:))] = ^E[^tri{modP){TM,,V:))]. 
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The right hand side is small as modP is uniformly bounded. In fact, taking 
P = (zi — Y„)^^(z2 — T„)^^U„, we find that modP is uniformly bounded by 
2/(|3z2|(|3zi| A 1)2) and therefore (EB holds once we add to 0{n, 11,12) the 
above right hand side which is at most of order l/n|3z2|(|3zi| A 1)^. Since our 
arguments did not require a very fine control on the error term, we see that this 
change will not affect them. Since concentration inequalities also hold under the 
Haar measure on the orthogonal group, see [1 , Theorem 4.4.27] and [[B Corollary 
4.4.28], all the proof of Theorem[T]can be adapted to this set up. □ 
Acknowledgments: We thank Greg Anderson for many fruitful and encour- 
aging discussions. We thank Yan Fyodorov for pointing out the paper [fTSl . 
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